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Abstract 

We study the discrete memoryless Z-interference channel (ZIC) where the transmitter of the pair that 
g i suffers from interference is cognitive. We first provide upper and lower bounds on the capacity of this channel. 

£Sj ■ We then show that, when the channel of the transmitter-receiver pair that does not face interference is noiseless, 

qq ' the two bounds coincide and therefore yield the capacity region. The obtained results imply that, unlike in the 

Gaussian cognitive ZIC, in the considered channel superposition encoding at the non-cognitive transmitter as 
well as Gel'fand-Pinsker encoding at the cognitive transmitter are needed in order to minimize the impact of 
interference. As a byproduct of the obtained capacity region, we obtain the capacity result for a generalized 
Gel'fand-Pinsker problem. 



Index terms: Cognitive interference channel, capacity region, Z-interference channel, one-sided inter- 
ference channel 
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I. Introduction 

The interference channel (IC) [1] is a simple network consisting of two transmitter-receiver pairs. 
Each pair wishes to reliably communicate at a certain rate, however, the two communications interfere 
with each other. A key issue in such scenarios then, is how to handle the interference introduced by 
the simultaneous transmissions. This issue is not yet fully understood, and the problem of finding the 
capacity region of the IC remains open, except in special cases [2]— [12] . For a tutorial on the capacity 
results of the IC, see [13]. The Z- interference channel (ZIC) is an IC where one transmitter-receiver 
pair is interference-free. Although this is a simpler channel model than the IC, capacity results are 
still known only in special cases [6, Section IV], [14]— [16]. 

In certain communication scenarios, such as cognitive radio networks, some transmitters are cog- 
nitive, i.e., are able to sense the environment and thus obtain side information about transmissions 
in their vicinity. Perhaps due to the exciting promise of the cognitive radio technology to improve 
the bandwidth utilization and thus allow for new wireless services and a higher quality of service, 
the IC with one cognitive transmitter has been studied extensively [17]-[24]. Related channel models 
were also analyzed in [25], [26]. In the model considered in [17]— [25], it is assumed that due to the 
cognitive capabilities, the cognitive encoder noncausally obtains the full message of the non-cognitive 
transmitter. While this is a somewhat idealistic view of cognition in a wireless network, this model 
applies for example, to scenarios where the cognitive transmitter is a base station. Then, it can obtain 
side information via backhaul (high-capacity link such as an optical cable). This side information 
then enables interference reduction [27] by precoding at the cognitive encoder. Furthermore, it enables 
cooperation with the non-cognitive pair. In fact, one of the main difficulties in finding the capacity 
region of the traditional IC comes from distributed encoding. IC with one cognitive transmitter enables 
one-sided transmitter cooperation, and thus allows centralized encoding to some degree. This may be 
the reason why determining the capacity region of the cognitive IC is somewhat easier than the 
traditional IC. In particular, while the capacity region of the Gaussian IC in weak interference is not 
known (the sum capacity in certain weak interference regimes has recently been found in [28]-[30]), 
the capacity region of the cognitive Gaussian IC in weak interference has been determined [19], [20]. 

In this paper, we study a ZIC where the transmitter of the pair that suffers from interference 
is cognitive (see Fig. [I]). The capacity region of such a cognitive ZIC in the Gaussian case is 
straightforward to obtain, since by using dirty -paper coding [31] at the cognitive encoder, both 
communicating pairs can achieve the interference-free, single-user rates. However, limiting the study 
of the cognitive ZIC to the Gaussian case leaves some unsatisfaction to the understanding of the 
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Fig. 1. Cognitive Z-interference channel. 



problem. Firstly, it does not provide intuition as to how the interferer's rate affects the rate of the 
cognitive transmitter-receiver pair in a general channel. Secondly, it does not provide the insight into 
the optimal codebook structure for the non-cognitive encoder, so that it minimizes interference caused 
for the cognitive pair. 

Hence, in this paper, we study a discrete memoryless cognitive ZIC. We first derive an upper bound 
on the capacity region. The technique that we use in obtaining the converse was introduced by Korner 
and Marton in [32], and was proven to be useful in the solution of several problems in multi-user 
information theory [15], [16], [32], [33], including the Gel'fand-Pinsker problem [27]. We apply this 
technique twice to obtain the upper bound on the capacity region. Next, we derive a lower bound on 
the capacity region where the non-cognitive pair uses superposition encoding to control the amount 
of interference it causes for the cognitive pair. Unlike in the IC, this encoding approach has not been 
applied in the cognitive IC literature, with the exception of concurrent and independent work [23]. 
Finally, we show that the lower and upper bounds meet when the channel between the non-cognitive 
pair is noiseless. We denote this channel model as the cognitive ZIC with one noiseless component. 
From the capacity results, we conclude that it is optimal for the interference-causing (non-cognitive) 
pair to use superposition encoding; the inner codeword is decoded by the receiver of the cognitive pair 
while Gel'fand-Pinsker coding is performed against the outer codeword at the cognitive transmitter. 

The capacity region of the discrete memoryless cognitive IC is known in some special cases [18], 
[19], [23]. The tight result we derive in this paper does not fall into these special cases, as explained 
in more details in Section |V] Furthermore, the cognitive ZIC with one noiseless component is the first 
channel model for which superposition encoding at the non-cognitive transmitter is not only required 
but also optimal. 

Note that, in general, the capacity region of the traditional ZIC in which the interference-free 
transmitter-receiver pair is noiseless, is unknown. The most we know about this scenario is the sum 
capacity [15]. Thus, the results in this paper provide yet another example where finding the capacity 
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region of the cognitive IC is easier than that of the traditional IC due to the possibility of centralized 
(joint) encoding by the cognitive transmitter. 

The considered problem is also intimately related to the Gel'fand-Pinsker (GP) problem [27] where 
a transmitter-receiver pair communicates in the presence of interference noncausally known at the 
encoder (see Fig. [2]). By viewing the non-cognitive encoder in the cognitive ZIC as a source of this 
interference, we arrive to a generalized GP problem. Instead of the state being i.i.d. as in the GP 
problem, in the generalized GP model considered in this paper, the state is uniformly distributed on 
a set of size 2 nR2 , where R 2 is a number between and the logarithm of the cardinality of the state 
space. The further generalization is that, unlike in [27], in our model one can optimize the set, i.e., 
the structure of the interference. The solution of this paper shows that the optimal interference has a 
superposition structure. 

II. System Model 

Consider a ZIC with two transition probabilities p(yi\xi, x 2 ) and p(y 2 \x 2 ). The input and output 
alphabets are X\, X 2 , and y 2 . 

Let Wi and W 2 be two independent messages uniformly distributed on {1, 2, ■ • • , Mi} and {1, 2, • ■ ■ , 
M 2 }, respectively. Transmitter % wishes to send message W{ to Receiver i, i = 1,2. Transmitter 1 is 
cognitive in the sense that, in addition to knowing W\, it knows the message W 2 . An (Mi, M 2 , n, e n ) 
code for this channel consists of a sequence of two encoding functions 

fi :{1 5 2, • ■ • , Mi} x {1,2, ■• ■ ,M 2 } — > Af x n , (1) 

/ 2 n :{l,2,.-- ,M 2 }-A2\ (2) 

and two decoding functions 

g?-.y?^{l,2,--- ,Mi}, i = l,2 (3) 

with probability of error 

6n = M7M £ Pr \9?0T) ¥> *>i\Wi = w u W 2 = w 2 ] . (4) 

W\ ,102 

A rate pair (i?i,i? 2 ) is said to be achievable if there exists a sequence of (2 nRl ,2 nR2 ,n,e n ) codes 
such that e n — >• as n — >• oo. The capacity region of the cognitive ZIC is the closure of the set of all 
achievable rate pairs. 

A cognitive ZIC with one noiseless component is a cognitive ZIC where the channel between X 2 
to Y 2 is noiseless, i.e., p(y 2 \x 2 ) is a deterministic one-to-one function. 
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Fig. 2. Gel'fand-Pinsker problem. 



Throughout the paper, we use the following shorthand for random vectors: K % = Ki, 
K 2 ,--- ,Ki and K? +1 = K i+1 , K i+2 , ■■■ ,K n . 

III. Converse 

In this section, we provide an upper bound on the capacity region of the cognitive ZIC. 
Theorem 1: Achievable rate pairs R 2 ) belong to a union of rate regions given by 

R± < I^Y^V) - I(U;Y 2 \V) (5) 

R2 < I(X 2 ; Y 2 \V) + min {I(V; Y 1 ),I(V; Y 2 )} (6) 

where the union is over all probability distributions p(v, u, x 2 )p(xi\u, x 2 ) and the mutual informations 
are calculated according to distribution 

p(v,u,x 1 ,x 2 ,y 1 ,y 2 ) = p(v, u, x 2 )p(x 1 \u, x 2 )p{yi\x 1 , x 2 )p(y 2 \x 2 ). (7) 
Proof: The proof is provided in Section IVII-A1 ■ 
This converse result is obtained by using the converse technique of Korner/Marton [34, page 314] two 
times, resulting in two auxiliary random variables. 

IV. ACHIEVABILITY 

The achievability scheme uses a combination of superposition encoding at the non-cognitive encoder 
and GP encoding of the outer codeword of interference at the cognitive encoder. The performance is 
given in the following theorem. 

Theorem 2: The union of rate regions given by 

Rx< IiU-^V) - I{U;X 2 \V) (8) 
R2 < I(X 2 ; Y 2 \V) + min {/(V; Y x ), I(V; Y 2 )} (9) 

is achievable, where the union is over all probability distributions p(v, u, x 2 )p(xi\u, x 2 ) and the mutual 
informations are calculated according to the distribution in ©. 
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Proof: The proof is provided in Section IVII-Bl ■ 
Remark: The proposed achievability scheme is a special case of the independent and concurrent work 
[23, Theorem 2] by setting U 10 = V, (U n , U 10 ) = (V n , U 10 ) = X u V 20 = 0, V 22 = U, L 20 = R 20 = 0, 
L\\ = Rn and then swapping the indices of 1 and 2 because in [23], the second transmitter-receiver 
pair is cognitive. 

V. Capacity Region of the Cognitive ZIC with One Noiseless Component 

In general, the achievability results in ®-(|9]) and the converse results in ©-© do not meet, due 
to the fact that 

I(U;X 2 \V)>I(U;Y 2 \V) (10) 

because the random variables satisfy © which implies that Markov chain U — > (V, X 2 ) — > Y 2 holds. 

However, in the case where the channel output Y 2 = X 2 , the achievability results and the converse 
results meet, yielding the capacity region. More specifically, we have the following capacity results 
for the cognitive ZIC. 

Theorem 3: For cognitive ZIC with one noiseless component, i.e., p(y 2 \x 2 ) is a deterministic one- 
to-one function, the capacity region is given by the union of rate regions: 

R 1 <I{U;Y 1 \V)-I(U;X 2 \V) (11) 
i? 2 < H(X 2 \V) + min {I(V; Y t ), I(V; X 2 )} (12) 

where the union is over all probability distributions p(v, u, x 2 )p(xi\u, x 2 ) and the mutual informations 
are calculated according to the distribution in ©. 

Remark: Similar to the solution of the GP problem, one may restrict ■p(xx\u, x 2 ) to be a deterministic 
function, i.e., p(xx\u, x 2 ) only takes the values of and 1, in the union in Theorem [3l To see this, 
observe that for a fixed p(v,u,x 2 ), only (fTTj) and the term I(V;Yi) in f|T2|) depend on p(x 1 \u,x 2 ). 
The right-hand side of (flTT) can be written as 

/([/; Yx\V) - I(U; X 2 \V) = ^p(v) (I(U; Y,\V = v) - /(£/; X 2 \V = v)) (13) 

V 

which is a linear combination of convex functions of p(x±\u, x 2 ) [27, Proposition 1 (ii)]. Thus, the right- 
hand side of (fTTj) is a convex function of p{x\\u, x 2 ) and the maximum is achieved by a deterministic 
function. For the fixed p(v), I(V;Yi) is convex in p(yi\v) [35, Theorem 2.7.4], which is a linear 
function of p(x 1 \u,x 2 ) for fixed p{u,x 2 \v) and p(yi\xi, x 2 ). Thus, I(V;Yi) is a convex function of 
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p(xi\u, x 2 ) and the maximum is achieved by a deterministic function. Hence, both (flTT) and I(V; Yi) 
in (fT2l) are maximized by a deterministic p(a;i|ii, x 2 ). 

We conclude from Theorem [3] that, in the special case of noiseless channel between the interference- 
free transmitter-receiver pair, to minimize the effect of interference caused to the cognitive transmitter- 
receiver pair, the non-cognitive pair uses superposition encoding, allowing the cognitive pair to decode 
the inner codeword. In contrast to the Han-Kobayashi scheme [9] for the traditional ZIC, where 
the outer codeword of the interferer is treated as noise, here, due to the cognitive capability of the 
transmitter that faces interference, GP encoding is performed on the outer codeword to further reduce 
the effect of interference. 

The capacity region of the discrete memoryless cognitive IC is known in some special cases [18], 
[19], [23]. The cognitive ZIC with one noiseless component is not a special case of [18, Theorem 3] 
as it does not satisfy either of the two conditions of strong interference. It satisfies Assumption 3.1 
but not Assumption 3.2 in [19], and therefore its capacity region is not characterized by [19, Theorem 
3.4]. The capacity results in Theorem [3] is not a special case of [23, Theorem 5] as the received signal 
of the cognitive pair is not a deterministic function of the two channel inputs. Rather, in the cognitive 
ZIC with one noiseless component, the received signal of the non-cognitive pair is a deterministic 
function. Furthermore, it does not satisfy the mutual information inequality required in [23, Theorem 
5]. 

VI. Discussion 

In the case where Y 2 = X 2 , the cognitive ZIC problem can be seen as a form of generalized 
GP problem, where X 2 is the channel state that affects the communication between transmitter- 
receiver pair 1. This formulation generalizes the GP problem in the sense that, instead of the state 
(random parameters of the channel) being i.i.d., the state is uniformly distributed on a set of size 
2 nR2 . Furthermore, we are allowed the freedom, not only to design the codebook of the cognitive 
transmitter, but also the structure of the set where the states lie, in order to maximize the number 
of bits transmitted between the cognitive pair. We are then interested in the capacity of the cognitive 
transmitter-receiver pair, denoted as C(R 2 ), which is a function of R 2 . 

Using the capacity region for the cognitive ZIC with one noiseless component in Theorem [3j we 
see that the capacity of the cognitive pair when the state uniformly takes a value from a set of 2 nR2 
sequences is 

C{R 2 )= max I{U; Y X \V) - I{U; X 2 \V) (14) 

p(v,u,X2),p{xx\u,x 2 ) 
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where the maximum is over all distributions p(v,u,x 2 ),p(xi\u,x 2 ) that satisfy 

H(X 2 \V) + min {I(V; Y X )J{V\ X 2 )} > R 2 (15) 

Thus, in the generalized GP problem, when given the rate of the possible channel states R 2 , the optimal 
interference has a superposition structure. 

Remark: When R 2 = log \X 2 \, C(R 2 ) reduces to the GP rate where the state is i.i.d. and uniformly 
distributed on set X 2 . This can be seen as follows: first, by choosing V = <p and p(x 2 ) to be the 
uniform distribution on X 2 in the maximization of (fl4)) . we obtain the GP rate. Hence, we conclude 
that C(log \X 2 \) is no smaller than the GP rate. On the other hand, when R 2 = log \X 2 \, according to 
(fT5l) . the distribution we are allowed to maximize over in (fl4l) has to satisfy 

1) p(x 2 ) is the uniform distribution on X 2 

2) I{V-Y X )>I{V-X 2 ) 
which means 

C{\og\X 2 \)= max I(U; Y X \V) - I(U; X 2 \V) (16) 

p(v,u\x2)p(xi\u,x 2 ): I{V;Yl)>I(V;X 2 ) 

< max ■ UV-Yj+IiU-YW-mX^-IiU-X^V) (17) 

p(v,u\x2)p(x\\u,X2): I{V;Yi)>I(V;X2) 

max I(U,V;Yx) - I(U,V;X 2 ) (18) 

p(v,u\x2)p(xi \u,X2): I (V ;Yi)>I (V ;X2) 

< max I(U, V; Y t ) - I{U, V; X 2 ) (19) 

p(v ,u\x2)p{x\ \u,V,X2) 

where in (fT6l)-(fT9l). we have implicitly assumed that p(x 2 ) is the uniform distribution. By setting 
(U, V) — U in (TT9l , we see that (TT9l) is the GP rate, which means that C(log \ X 2 \) is no larger than 
the GP rate. Thus, we conclude that C(log \ X 2 \) is equal to the GP rate where the state is i.i.d. and 
uniformly distributed on set X 2 . 

VII. Proofs 

A. Proof of Theorem [7] 

Following from Fano's inequality [35], we have 

nR x < H{Y?) - H{Y^\W 1 ) + ne n (20) 
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and 

nR 2 < H{Y 2 n ) - H(Y 2 n \W 2 ) + ne n (21) 

< H(Y 2 n ) - H(Y 2 n \W 2 ,X2)+ne n (22) 

= H(Y»)-H{Y»\X»)+ne n (23) 

n 

= H(Y 2 n ) - H(Y 2t \X 2i ) + ne n (24) 



i=l 

where (12~3l follows from the Markov Chain VF 2 — * X 2 — ► l^ n , and (l24j) follows from the memoryless 
property of the channel p(y 2 \x 2 ). 

Applying the technique [34, page 314, eqn (3.34)] twice, we obtain 

n 

H{Y?) - H{Y?) = Y,H{Yii\Yt\Y?(i + i)) ~ H{Y 2i \Yt\Y% i+1) ), (25) 

8=1 
n 

H{Y?\W X ) ~ = ^H{Y x J\Yt 1 ,Y^ 1)i W x ) - H(Y 2i \Yl~ 1 , Y^ i+1 ) ,Wi). (26) 

i=i 

Define auxiliary random variables as 

V i = Yi-\Y% i+1) , i = 1,2,--- ,ti. (27) 

Further define Q to be an auxiliary random variable that is independent of everything else and uniform 
on the set {1, 2, • ■ ■ , n}, and 

V=(V Q ,Q), U=(V,W x ), X 1 = X 1Q , X 2 = X 2Q , Y 1 = Y 1Q , Y 2 = Y 2Q . (28) 

It is straightforward to check that the random variables thus defined satisfy ©. 
Following from (1231 ) and (|26l , we have 

i (H(Y?) - H{Y 2 )) = H{Y X \V) - H(Y 2 \V) (29) 

- {H{Y?\W X ) - H{Y 2 n \W x )) = H{Y X \U) - H{Y 2 \U). (30) 
n 

Notice that (|29| ) implies that there exists a number 7 where 

-H(Y?) = H(Y 1 \V) + 1 (31) 
n 

-H(Y 2 n ) = H(Y 2 \V) + 1 (32) 
n 

< 7 < min {I(V; Y X ),I(V; Y 2 )} (33) 
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where CED follows because H(Y?) < nH{Y x ) and H{Y 2 n ) < nH{Y 2 ) and 

n n 

H{Y?) = H(Y\i\Yi~ l ) > H<y u \Yt\ ^+i)) = nH^V) (34) 



i=l i=l 

Following from (l24l) . we have 



R 



1 1 n 

-H{Y?)--Y^H{Y 2i \X 2i ) + e n 



2 

n ' ~ ' n 

i=i 



= #(F 2 |V) + 7 - ^(^1^2, Q) + e n (35) 

= H(Y 2 \V) + 7 - ^(y 2 |X 2 ) + e n (36) 

< H(Y 2 \V) + min {I(V; Y x ), I(V; Y 2 )} - H(Y 2 \X 2 ) + e n (37) 

= I(X 2 ; Y 2 \V) + min {I(V; Y 1 ),I(V; Y 2 )} + e n (38) 

where (|35l ) follows from (|32l ) and the definition of the random variables in (|28l) ; (1361) follows by 
the memoryless nature of the channel p{y 2 \x 2 ); (1371) follows from (1331) ; and (1381) follows because the 
random variables satisfy C7]) which implies that Markov chain V — > X 2 — > Y 2 holds. 
Following from (l20l) . we have 



Ri < -H(Y») - —H(Y™\Wi) + e n 
n n 

= —H(Y 2 n ) + H(Y 1 \V) - H(Y 2 \V) - -H{Y 2 n \W l ) - H{Y l \U) + H(Y 2 \U) + e n (39) 
n n 

= H{Y X \V) - H{Y 2 \V) - H{Y l \U) + H{Y 2 \U) + e n (40) 
= I(U;Y 1 \V)-I(U;Y 2 \V) + e n (41) 

where d39l) follows from (f29l) and ([30]); (@0l) follows from the fact that Y 2 n only depends on X% and 
the channel noise induced by p(y 2 \x 2 ), and is therefore independent of W±; and (I4TI) follows because 
the random variables satisfy © which implies that Markov chain V — > U — > (Yi, Y 2 ) holds. 
We obtain the desired upper bound on the capacity region from (|38l ) and (I4TT) . 

B. Proof of Theorem |2] 

Since the encoding/decoding procedure follows the standard steps, the detailed calculation of the 
probability of error is omitted. 
Codebook generation: 
Fix a distribution p(v, u, x 2 )p(xi\u, x 2 ). 

The codebook at Transmitter 2 is generated as follows: generate 2™ 7 sequences v n in an i.i.d. 
fashion using p(v). These v n sequences constitute the inner codebook. For each v n , generate 2 n( - R2 ~^ 
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sequences x 2 in an i.i.d. fashion using p(x 2 \v). These x 2 constitute the outer codebook of Transmitter 
2 associated with v n . 

The codebook at Transmitter 1 (the cognitive transmitter) uses the same inner codebook as Transmit- 
ter 2 and the outer codebook of Transmitter 1 is generated as follows: for each v n sequence, generate 
2"(i?i+i?o) sequences u n in an i.i.d. fashion using p(u\v). These u n constitute the outer codebook of 
Transmitter 1 associated with v n . Randomly distribute them into 2 nRl many bins. Each bin will 
contain approximately 2 nR ° many u n sequences. 

Encoding: 

Transmitter 2 splits its message W 2 into two independent parts W 2a and W 2b , with rates 7 and 
R 2 — 7, respectively. For W 2a = w 2a and W 2b = w 2b , it finds the w 2a -th codeword in the inner 
codebook, denoted as v n , and transmits the w 2b -th codeword in the outer codebook (denoted as x 2 ) 
of Transmitter 2 associated with v n . 

The cognitive encoder knows W 2 and therefore knows x 2 and v n . For W\ — wi, it look into the 
Wi-th bin in the outer codebook of Transmitter 1 associated with v n , and find the u n (denoted as u n ) 
that is jointly typical with x 2 conditioned on v n . This can be done almost always as long as 

Ro>I{U-X 2 \V) (42) 

is satisfied. The cognitive encoder then transmit an x\ sequence generated i.i.d. conditioned on u n 
and £2 using p{x\\u, x 2 ). 

Decoding: Receiver 2 first finds the unique v n sequence in the inner codebook that is jointly typical 
with received sequence y% while treating everything else as noise. This can be done if 

1<I(V-Y 2 ) (43) 

Based on the v n sequence it decoded, Receiver 2 then finds the unique x r 2 l sequence that is jointly 
typical with y 2 conditioned on v n in the outer codebook of Transmitter 2 associated with v n . This can 
be done if 

R2-1<I(X 2 ;Y 2 \V) (44) 

Receiver 1 first finds the unique v n sequence in the inner codebook that is jointly typical with 
received sequence y{ 1 while treating everything else as noise. This can be done if 

1 <I(V-Y 1 ) (45) 
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Based on the v n it decoded, Receiver 1 then finds the unique u n sequence that is jointly typical 
with 2/2 conditioned on v n in the outer codebook of Transmitter 1 associated with v n . This can be 
done if 

R + Ro < I{U]Y X \V) (46) 
Based on (l4~2~l)-(l46l). using Fourier-Motzkin elimination, we obtain the desired result. 
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